On Fitting Mixture Models
نویسندگان
چکیده
Consider the problem of tting a nite Gaussian mixture, with an unknown number of components, to observed data. This paper proposes a new minimum description length (MDL) type criterion, termed MMDL (formixtureMDL), to select the number of components of the model. MMDL is based on the identi cation of an \equivalent sample size", for each component, which does not coincide with the full sample size. We also introduce an algorithm based on the standard expectationmaximization (EM) approach together with a new agglomerative step, called agglomerative EM (AEM). The experiments here reported have shown that MMDL outperforms existing criteria of comparable computational cost. The good behavior of AEM, namely its good robustness with respect to initialization, is also illustrated experimentally.
منابع مشابه
The Family of Scale-Mixture of Skew-Normal Distributions and Its Application in Bayesian Nonlinear Regression Models
In previous studies on fitting non-linear regression models with the symmetric structure the normality is usually assumed in the analysis of data. This choice may be inappropriate when the distribution of residual terms is asymmetric. Recently, the family of scale-mixture of skew-normal distributions is the main concern of many researchers. This family includes several skewed and heavy-tailed d...
متن کاملOn some Variants of the EM Algorithm for the Fitting of Finite Mixture Models
Finite mixture models are being increasingly used in statistical inference and to provide a model-based approach to cluster analysis. Mixture models can be fitted to independent data in a straightforward manner via the expectation-maximization (EM) algorithm. In this paper, we look at ways of speeding up the fitting of normal mixture models by using variants of the EM, including the so-called s...
متن کاملBayesian Curve Fitting Using Multivariate Normal Mixtures
Problems of regression smoothing and curve fitting are addressed via predictive inference in a flexible class of mixture models. Multidimensional density estimation using Dirichlet mixture models provides the theoretical basis for semi-parametric regression methods in which fitted regression functions may be deduced as means of conditional predictive distributions. These Bayesian regression fun...
متن کاملLatent class representation of the Grade of Membership model
Latent class and the Grade of Membership (GoM) models are two examples of latent structure models. Latent class models are discrete mixture models. The GoM model has been originally developed as an extension of latent class models to a continuous mixture. This note describes a constrained latent class model which is equivalent to the GoM model, and provides a detailed proof of this equivalence....
متن کاملFitting Multivariage Normal Finite Mixtures Subject to Structural Equation Modeling
This paper is about fitting multivariate normal mixture distributions subject to structural equation modeling. The general model comprises common factor and structural regression models. The introduction of covariance and mean structure models reduces the number of parameters to be estimated in fitting the mixture and enables one to investigate a variety of substantive hypotheses concerning the...
متن کاملMIXFIT: an algorithm for the automatic fitting and testing of normal mixture models
We consider the fitting of normal mixture models to multivariate data, using maximum likelihood via the EM algorithm. This approach requires the specification of an initial estimate of the vector of unknown parameters, or equivalently, of an initial classification of the data with respct to the components of the mixture model under fit. We describe an algorithm called MIXFIT that automatically ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1999